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Abstract 

In this paper, we develop a Markovian model that deals with the volume offered at the best 
quote of an electronic order book. The volume of the first limit is a stochastic process whose 
paths are periodically interrupted and reset to a new value, either by a new limit order submitted 
inside the spread or by a market order that removes the first limit. Using applied probability 
results on killing and resurrecting Markov processes, we derive the stationary distribution of 
the volume offered at the best quote. All proposed models are empirically fitted and compared, 
stressing the importance of the proposed mechanisms. 


1 Introduction 


The limit order book has recently been the subject of a growing interest among academics and prac¬ 
titioners studying financial markets. This electronic structure centralizes all the orders submitted 
to a given market by all participants, finds matchable buy and sell orders, and therefore defines 
the price of the financial product exchanged. The fundamental question is thus to understand 
how the sequence s of submitted orders - the orders flows - are translated into price dynamics. 


Biais et al. 


(199a) and 


Bouc. 


laud et al. 


ties of the limit order book. 


Smith et al. 


{ 2OO2II a re pioneer investigations on the empirical proper- 


(|2003l l is a pioneer theoretical framework for the study 


of the continuous double auction that is used in limit order books. 
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The order book is a complex system. Basic mathematical models, such as 


Cont et al. 


(|20ldb 


rely on simplifying assumptions: only three main types of orders are submitted (limit, market 
and cancellations, ignoring exchange-specifi c rules and specificities) ; all orders flows are Poisson 
processes ; all orders have the same unit size. lAbergel &: Jedidil (j2013l i shows that under appropriate 
assumptions, some limit t heore ms apply and such Markovian models lead to a diffusion equation 
for the price. iMuni Tokel (j2014l l shows that with similar Poisson models, the average shape of the 


order book is analytically computable, even when relaxing the unit-volume hypothesis. 

Among the important quantities that describe a limit order book, the volume offered at the 
best quote (bid or ask), i.e. the total number of shares available at the first limit of the order 
book, is fundamental. One reason is that this quantity is often the only information easily available 
to market participants (known as Level-1 data). Another (linked) reason is th a t this quantity is 
heavily used in trading strategies by market participants : iDovne Farmer et al. (j2004l i shows that 
the majority of market orders that move the price have a size exactly equal to the volume of the 
best quote at the time of submission. 

However, the volume of the best quote remains difficult to grasp in a model, since market events 
make it vary widely. When the volume of the best quote drops to zero (because of large market 
orders or cancellations that remove all the liquidity of the best quote), or when a new best quote is 
submitted inside the spread, the price changes, and the volume of the best quote is reset to a new 
quantity that may have no link to the quantities describing the last order submitted. Therefore, 
computing the volume of the best quote after a new event requires to keep track of the whole order 
book. 

Recently, ICont &: De LarrardI (j2012l ) proposes a model in which the order book is restricted 
to its first limits. When the price does not move, the volume at the best quote obviously varies 
according to the arriving orders flows, and when this volume drops to zero, the price moves and 
the volume at the best quote is immediately reset to some random value. Therefore, the volume of 
the best quotes (bid and ask) is a two-dimensional process with values on the positive ortha nt that 
jumps randomly inside the orthant each time it reaches an axis. ICont &: De LarrardI (|2012l j shows 
that under appropriate assumptions and using limit theorems in the spirit of queueing theory, this 
volume may approximately exhibit a jump-diffusive behaviour. There is however a very restrictive 
assumption for this model to be valid: the spread has to always be equal to one tick. Indeed, one 
cannot allow for limit orders submitted inside the spread in this framework, since it would make 
the process jumps even when it does not reach an axis. Such an assumption may be an appropriate 
model for some very busy periods of trading of very liquid stocks, but probably not in the general 
case. 

In this paper, we show that in the classical zero-intelligence order book model with Poisson 
processes, the stationary distribution of the volume offered at the best quote can be analytically 
and then numerically computed. The proposed model is basic but flexible. It does not assume that 
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the spread is always equal to one tick, i.e. all types of events that make the volume of the best quote 
jump are taken into account: aggressive market orders that matches the full first limit, as well as 
aggressive limit orders submitted inside the spread as well. The main idea is that such ju mps of 
the v olume at the best quote can be identified as killing and resurrecting a Markov process (jPakea . 


19971 1. The assumptions that all orders have to be unit-sized can even be lifted, with additional 


restrictions on the volume of market orders. 

The remainder of the paper is organized as follows. Section [2] describes precisely the general 
zero-intelligence model of the order book with Poisson processes that is used here. The main 
result on killing and resurrecting Markov processes is recalled and adapted to the order book 
context. Sections [3] and m then explore two types of restrictive assumptions that allow the analytical 
computation of the stationary distribution of the volume at the best quote: Section [3] excludes 
market orders that partially match the best quote, while Section 0] allows for all types of market 
orders but with some size restrictions. Finally, Section [5] provides empirical fittings of all the 
analytical models proposed, with a comparison to the simulated ’’best effort” of the unrestricted 
Poisson model. 


2 A Markovian model of the one-side order book 

Let us consider the best ask quote of an electronic order book (the model for the best bid is strictly 
identical). The quantity offered at the best quote may be modified by the arrival of market or limit 
orders and by cancellation of existing limit orders. 

Firstly, we assume that limit orders at the best quote are submitted according to a Poisson 
process with rate Ai and that the size of limit orders at the best quote form a set of independent 
and identically distributed random variables with probability distribution (5i,n)nGN- Secondly, we 
also assume that each unit-size component of a limit order (i.e. each share) standing at the best 
quote is cancelled some random time after its submission. All these random times are assumed to 
form a set of independent and identically distributed random variables with exponential distribution 
with parameter 6i > 0. Thirdly, we assume that we observe ’’partial” market orders (the meaning of 
partial will soon be clear), that are submitted according to a Poisson process with rate ^ and are all 
unit-sized (one share). We also assume that the cancellation mechanism and the ’’partial” market 
orders cannot fully deplete the best quote. In other words, the last share of the best quote cannot 
be cancelled, and cannot be matched by a partial market order (hence the name of the ’’partial” 
market orders, which match only partially the best quote). This small assumption ensures that 
classic cancellations and market orders cannot make the price move. 

Two types of orders can make the price move: the aggressive market orders and aggressive limit 
orders. Orders of the first type, aggressive market orders, are submitted according to a Poisson 
process ^a-, and their size is equal to the available quantity at the best quote at the time of the 
submission. In other words, aggressive market orders match all the liquidity available at the best 
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quote. This is not an unreasonable assumption: 


Dovne Farmer et al. 


(2004) shows that on a 16- 


stock sample from the London stock exchange, 86% of the buy market orders that change the price 
have a size that is exactly equal to the volume offered at the best ask. When an aggressive market 
order is submitted, the volume available at the best quote drops to zero, the price moves up and 
the quantity available at the best limit is instantaneously reset to the quantity available at the next 
non-empty limit of the order book. This next non-empty limit of the order book evolves according 
to the submission and cancellation of limit orders. We may already specify, for future use, a model 
that is coherent with the Markovian setting we are establishing. To this end we will assume the 
following: the limit orders arrive according to a Poisson process with rate A 2 ; the sizes of these 
limit orders form a set of independent and identically distributed random variables with probability 
distribution ( 52 ,n)ngN ; each unit-size component of a limit order (i.e. each share) standing at the 
next non-empty limit is cancelled some random time after its submission, and all these random 
times are assumed to form a set of independent and identically distributed random variables with 
exponential distribution with parameter 62 > 0. 

The second type of orders that move the price are the aggressive limit orders. These are 
limit orders that are submitted inside the spread, i.e. at a price lower than the current best ask. 
We will assume that aggressive limit orders are submitted according to a Poisson process with 
rate Aq, and that the size of all aggressive limit orders form a set of independent and identically 
distributed random variables with probability distribution (5o,n)neN- The effect of the submission 
of an aggressive limit order is simple: at the moment of the submission, this order instantly becomes 
the best quote, i.e. the ask price is reset to the price of the aggressive limit order, and the quantity 
available at the best quote is reset to the volume of the submitted limit order. 

In summary, the continuous-time stochastic process X = {X{t),t € [0,oo)} describing the vol¬ 
ume available at the best quote evolves as follows. Let ri be the random time of the first price move. 
During the time interval [0,ri), in the absence of events that move the price (aggressive market 
orders, aggressive limit orders), X evolves as the stochastic process 1 -|- T = {1 -|- T {t),t G [0, 00 )}, 
which is one (the last share that cannot be cancelled or executed) plus the size of a queue with the 
infinitesimal generator: 


( -Al 


Aifl' 1,2 


-*^101 ,4 • 

•• ^ 


— (/r -|- Al -|- 9i) 

-^101,1 

-^101,2 

Ai01,3 • 


0 

/r -|- 201 

— (/i -|- Al -|- 20i) 

-^101,1 

Ai01,2 • 


0 

0 

/r -|- 301 

— (/i -|- Al -|- 30i) 

Ai0i,i . 





Now, at time ri, the price moves because of an aggressive limit or aggressive market order. 
The process X is instantaneously reset to a new random variable X^-^ = H that, depending on 
the direction of the price move, represents either the size of the incoming aggressive limit order 


4 







(downward price move), or the volume offered at the next non-empty limit inside the order book 
(upward price move). Then, if T 2 is the random time of the next price movement, X on [ti,T 2 ) 
behaves according to the infinitesimal generator and so on. 

This mechanism is identifiable to what is known in applied probability as killing and resurrecting 
a Markov process. The process of the volume of the best quote starts at time 0 and evolves according 
to the infinitesimal generator ©• Then, upon the submission of an aggressive limit or market order, 
it is killed, and (instantaneously) resurrected to a random variable H with distribution 
from where it restarts its course according the previous dynamics. Such a mechanism is studied in 
JiqqtI i. where the following result is proved. 


Takes 


Theorem 1 (rephrased from lPakea (119971 1). Let Z = {Zf, t € [0, oo)} be a Markov process on N with 
initial distribution (hi)igN. Zero state is assumed to be absorbing for Z. Let R = {Rt,t G [0, oo)} 
be the process constructed as follows : R starts following some path of the process Z ; at some 
random time (exponentially distributed with parameter fd > t)), R is killed, i.e. reset to 0 ; after 
some random time (exponentially distributed with parameter a > 0), R is resurrected, i.e. restarts 
following a new path of the process Z ; and so on. 

Then the process R admits a stationary distribution (vTj )jgjsj) given by: 


TTo = 


/3 


a + P - aPfoifd) 


TTj = a-Kofjild), 


( 2 ) 


where fj is the Laplace transform of the series hiPij{t), in which pij{t) = P{Z{t) = j\Z{t)) = i) 

iSN 

is the transient probability of the (non-killed) process Z from state i to state j. 

In our order book model, the resurrection is instantaneous, i.e. a —>■ -|-oo ; the killing events 
are aggressive limit and market orders, i.e. /3 = Aq + by standard properties of the Poisson 
processes ; the state 0 is not accessible without killing (classic cancellations and partial market 
orders cannot deplete the best limit), i.e. /o = 0. Another consequence of the last fact is that the 
distribution (hj)jgN* of H represents exactly the new volume available at the best quote after an 
aggressive event. This distribution is therefore a mix of the distribution ((7o,i)iGN* with probability 

Aq 


TA + Ao 
probability 


and the stationary distribution of the volume at the second limit, denoted (vr 2 ,i)ieNj with 
LA 


-. We thus obtain the following result. 


hA + Ao 

Proposition 1. In the general Poisson order book model described in this section, the stationary 
distribution {'n'j)j£n* of the total volume offered at the best quote is written for any j > 1 : 


T^j — (Ao + hA)fji^0 + Ta) 


(3) 
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where fj{-) is the Laplaee transform of the series Kpi^j(t) in which Vi € N*, 

ieN 

, ^090,i , hA'^2,i 

hi = T - ^- + T-^-, (4) 

^0 + hA ^0 + hA 

and Pijft) = P(1 + Y{t) = j\l + y(0) = i) is the transient probability of the (non-killed) process 
1 + y from state i to state j . 

This result is central in this work. In the following, we will study several specifications of the 
above general model, all of them allowing analytical tractability at some cost. The first criterion 
dividing the different types of models is the presence or absence of ’’partial” market models. In 
the first type of model (Type-1 models), we assume that all market orders are aggressive market 
orders. In other words /U = 0, and there are no ’’partial” market orders that match only partially 
the best limit. In the second type of models (Type-2 Models), this restriction is lifted, i.e. market 
orders may or may not be aggressive (/x > 0), but some restrictions on the distributions of the 
volumes will be added. These different types of models are studied in the next two sections. 

3 Models with aggressive market orders exclusively 

Type-1 models assume that all market orders are aggressive, i.e. that /x = 0. In this setting, we 
are able to analytically compute the stationary distribution tt of the process X through a direct 
approach; we compute the transient probabilities {pij{t))ij£^ by a standard generating function 
method, as well as the stationary distribution (7r2,i)igN* by a direct method. 

Let us start with the transient probabilities {pij{t))ij^n- Let pij = ri-ij-i for any (i,j) € 
(N*)^. {rij{t))ij^fi are the transition probabilities of the process Y. The Kolmogorov forward 
equations are written for any {i,j) G N^; 

j-i 

r'ijit) = -(^1 +j^i)rij{t) + (j -h l)6»irij+i(t) + ^Xigi,j-kri,k{t). (5) 

k=0 


+ OD 

Let Gi{z) = be the generating function of the distribution of the sizes of incoming limit 

i=i 

orders at the best limit. By multiplicating this equation by and summing over j, we obtain that 

OO 

the generating function ipi{z,t) = ''^^rij{t)z^ is solution of the partial differential equation: 

j=0 


0 = ^iz,t) - 01(1 - z)^{z,t) + Ai(l - Gi(z)}(pi(z,t}, 


( 6 ) 


subject to the initial condition ipi{z, 0) = z^. 
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As for the stationary distribution (7r2,i)i6is}*, we use the Markovian setting described in section 
[2]: A 2 > 0 is the rate of arrival of limit orders, ( 5 ' 2 ,n)nGN is the distribution of their sizes, and 1/02 
is the average lifetime of a share standing inside the book. Similarly to what has been assumed for 
the best quote, we assume that the last share cannot be cancelled, so that the size of the queue 
does not drop to zero (it is by definition the next-non empty limit of the order book). With these 
assumptions, the size of the book at the second limit is the process {1 + Y 2 {t),t G [0, 00 )} with 
infinitesimal generator 


—A 2 

-^252,1 

^292,2 

^292,3 

^292,A 

02 

— (A 2 -I- 02 ) 

^292,1 

^292,2 

^292,3 

0 

202 

— (A 2 -I- 202) 

^292,1 

^292,2 

0 

0 

302 

~(A2 -|- 302) 

^292,1 


The process Y 2 admits a stationary distribution {p 2 ,i)ief>i, and obviously 7r2,i = P 2 ,i-i- Writing 
the classical balance equations and solving the derived ODE for the generating function '4>{z) = 

+ 00 

P 2 ,nz"‘, we obtain : 

n=0 

A fz 1 —G2(^i) j 

= p2,oe'^ ° “ ( 8 ) 

00 

where G 2 {u) = '^^ 02 ,iU^ is the generating function of the distribution of the sizes of limit orders 

n=0 

submitted inside the book. 

Therefore, if we specify the distributions gi and 52 of the sizes of incoming limit orders re¬ 
spectively at the best quote and inside the book, and if subsequent computations are analytically 
tractable, then we can analytically derive the distribution vr. We study two variants of the model 
1 setting: 


Model la: Model la assumes that all limit orders submitted at the best quote or inside the 
book are unit-sized, i.e. gi^i = 52,1 = 1 and gi^n = 92 ,n = 0 for any n > 2. This assumptions 
is the o ne usually made in zero-intelligence models that look for some analytical tractability 
(see e.g. 


Cont et al. 


2OIOI). 


Model lb: Model lb assumes that all limit orders submitted at the best quote or inside the 
book are geometrically distribute d with parameters 0 < < 1 and 0 < ^2 < 1 respectively. 

This assumption has been used in iMuni Tokel (j2014l l in which it has been used to compute a 
general average shape of an order book. 


The results in these two cases are now stated. 


Proposition 2 (Model la). If all limit orders at the best quote or inside the book are unit-sized, 
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then the stationary distribution {7rj)j^fq* of the volume offered at the best quote is: 


oo min(i-lj-l) 


(Ao + ^J'a) E 


i=l 


k=0 


i — 1 




01 


f 


e-(Ao+M)4e-fceii(l _ g-0ity+i-2-2fcg-^(l-e ''l‘) 


and 


, ^ Xo9o,i hA (A2)^ ^ 

//A + Ao /iA + Ao (02y~^(i — 1)1 


( 10 ) 


Proof. The unit-size assumption gives Gi (z) = z, and inserting this in equation ([6]) allows a direct 
solving of the latter as: 


iPiiz,t) = 1 - (1 - 2:)e 


exp 




( 11 ) 


which then gives after some computations, using Leibniz differentiation formula, the transition 
probabilities: 




min(i,j) 

y ^_ 

k\{i-k)\{j-k)\ 





( 12 ) 


Shifting both indices by one, multiplying by hi, summing over i and taking the Laplace transform 
yields equation Q. 

Furthermore, still using the unit-size assumption, the stationary distribution of a process with 

A 2 

generator defined in equation ([7]) is a Poisson distribution with parameter —. This readily gives 

02 

equation (fTOll . □ 

Proposition 3. (Model lb) If all limit orders submitted at the best quote and inside the book 
are i.i.d. and geometrically-distributed with parameter qi and q 2 respeetively, then the stationary 
distribution {7rj)j^^* of the volume offered at the best quote is: 


T^j = (Ao + hA) 




. 1=1 


dl 


Ai 


min(i-l,j-2) 


E 


X 


a=l 

r*+OD 


+ E^ 


i=j 


h 

01 

r+oQ 

/ 

Jo 

u 


k=0 
j—2—k—l 


i — 1 


j-2-k 


E 




j -1-k 


(- 1 )' 




— e 
i — 1 


—d\t\ji—k^—{l+k)9it 


-l-l 


-h (1 - g-(Ao-rM)t 


r* + 00 


e-(i-i)'^it(l _ + (1 - dt 






















and 


J, _ ^090,i , „ (1-92)92 (1 “ ^2)* ^ ^ “*" (1-92)02 ^ 

— . X “r . \ Q 2 


9 A + -^0 9-A + -^G 


G-D' r(^) 


(13) 


Proof. The assumption of a geometric distribution of the sizes of limit orders inside the book gives 


Gi{z) = 


qiz 


. With this dehnition of Gi, we can solve equation ([6]) to obtain: 

qi + {l- gi)(l - z)e-^^^' 


1 - (1 - qi)z 

^,iz,t) = [l-{l-z)e-G^Y 


^1 

9(1-91) 


1 - (1 - qi)z 

The Leibniz differentiation formula and some computations lead to the transient probabilities: 

min(ij-l) 


(14) 






(1-e 




j _^ 


=0 


E 

1 = 

i 

n 

a=l 


q, + (l-q,)e-^^^ 


9i(1-9i) 




Ai 


13=1 


Ai 


- a(l - gi) n ( W + /3(1 - 91 ) 


+ h<i ( j ) - e-Gy-^ qi + {l- 


(15) 


As in the previous case, shifting the indices by one, multiplying by /ij, summing over i and taking 
the Laplace transform yields equation (fT^ . 

Now, as for the stationary distribution of the volume offered at the second limit, the assumption 


of geometrically distributed sizes of limit orders gives G 2 {z) = 
by derivation and after some computations: 


q2Z 


1 - (1 - q2)z 


Vi G N no • - {iAl)92^ 

^ ^1-92)02^ 


, and equation ([H) gives 


(16) 


hence the result. Note that the distribution (/92,i)ieN is a negative binomial distribution (or Polya 


distribution) with non-integer size parameter 


A 2 


Fellen . Il968l . Chap. VL). 


(1 - q 2)62 


and probability parameter q 2 (see e.g. 

□ 


4 Models with both partial and aggressive market orders 

Type-2 models allow for both partial and aggressive market orders to be submitted, i.e. /a > 0 
and fiA > 0. With both types of market orders, the direct approach of the previous section does 
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not provide analytically tractable results. Therefore we propose a different strategy. In this new 
setting, we can keep the analytical tractability of the model at the cost of assuming that classic 
orders directly affecting the best quote are unit-sized. This is the only restriction: orders submitted 
inside the spread or inside the book can be kept with a general distribution, and the aggressive 
market orders are still dehned the same way, obviously, with a size equal to the volume at the 
best quote. With this assumption, the best quote is a birth-and-death process, for which we 
can compute the Laplace transform of its transition probabilities, which can be e xpressed using 
continuous fractions. The original result dates back to 
derivation of the result is found in 


Vlurphy &: O’Donohod (|l975l l but a modern 


Crawford &: SuchardI (|2012l L Therefore, we are able to study the 


stationary distribution of the volume available at the best quote without computing the transient 
probabilities of the process Y with infinitesimal generator m and p > 0 (to our knowledge, such a 
computation is still an unresolved challenge). 

Assume that both partial market orders and limit orders submitted at the best quote are unit¬ 
sized. Then the process Y, which translates the evolution of the volume at the best quote (minus 
one) without any price movement, is a birth-and-death process with constant birth-immigration 
rate Ai and linear death-emigration rate n+n9i for any n > 0. Let {qm,n{t)), (m, n) G N^, t G [0, oo) 
be the transition probabilities of the process Y, and {qm,n{s)), {m,n) G N^,s G C their Laplace 
transform, if it exists. Let (i?n(s))neN the real sequence dehned by the two-step recurrence: 


Bo{s) = 1, Bi{s) = s -h Ai, 

Bn{s) = {s + Xi + fi + {n - 1 ) 01 ) Bn-i - \i{^i + (n- l)0i) Bn- 2 , n > 2. 


(17) 


Then, adapting ICrawford &: SuchardI (j2012l . Theorem 1) to our special case, we have for any 
(m, n) G such that m < n : 


qm,nis) — A]^ 


Oi 


A \n—m hl('S) <l2{s) 0 . 3 ( 5 ) 


61 + 


a2 


^ AC 


b2 + 


bi{s)+ ^ 2 ( 5 )+ ^ 3 ( 5 ) + 


(18) 


63+., 


(The symbol = is the dehnition equality that introduce a simplihed notation for the continuous 
fractions.) In the above equation the sequences (oj)jeN* and are dehned as follows: 


o,; = 


and 


bi = 


Bm{s) if i = 1, 

-Xi{n + {n + l)9i)Bnis) if i = 2, 

if i > 3, 

if i = 1, 

s-|-Ai-|-p-|-(n-|-i — 1)01 if i > 2. 



(19) 


( 20 ) 
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The result for any (m, n) G such that m > n is similarly written: 


Qm,nis) 


n 

\j=n+l 


o:i(s) 0 : 2 ( 5 ) 0 : 3 ( 5 ) 

/3l(s)+ /32('S)+ P3 {s) + 


(21) 


where the sequences and {/3i)i,=fq* are defined as follows: 


and 


(Xi 


Bnis) 

< +{m + l)6i) Bmis) 

-Ai (/U + (m + f - l)0i) 


if i = 1, 
if z = 2, 
if z > 3, 


A 


Bm+i{s) if z = 1, 

s + Ai+^ + (m + z — 1)01 if z > 2. 


( 22 ) 


(23) 


Hence, the Laplace transforms {qm,nis)), {m,n) G N^,s G C are numerically computable using 
appropriate numerical methods for the computation of continuous functions. If we now go back to 
the killing and resurrection of Markov processes, we have the following result. 


Proposition 4 (Type-2 models). If all orders submitted at the best quote are unit-sized, then 
the stationary distribution of the volume offered at the best quote is given by equation z.e. 

00 

= (Ao + hA) ^ hmqm-i,j-i{XQ + 1 za ), where the qm,n’s are given by ([T8]) and ([H]). 

m=l 

If limit orders submitted inside the book are assumed to be unit-sized as well (model 2a), then 
the probabilities hm are given by equation (uni). If the sizes of these limit orders are assumed to be 
geometrically distributed (model 2b), then the probabilities hm are obtained with equation (na). 


5 Empirical results 

We now provide an empirical study that allows the comparison of the models theoretically solved 
in the previous sections. 

5.1 Data and main parameters estimation 

We use Thomson-Reuters tick-by-tick data for eleven stocks traded on the Paris stock exchange, 
from January 4th, 2010 to February 22nd, 2010. The eleven stocks under investigation are: Air 
Liquide (AIRP.PA, chemicals), Alstom (ALSO.PA, transport and energy), Axa (AXAF.PA, in¬ 
surance), BNP Paribas (BNPP.PA, banking), Bouygues (BOUY.PA, construction, telecom and 
media), Carrefour (CARR.PA, retail distribution), Danone (DANO.PA, milk and cereal products), 
Michelin (MICP.PA, tires manufacturing), Renault (RENA.PA, vehicles manufacturing), Sanofi 
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(SASY.PA, healthcare), Vinci (SGEF.PA, construction and engineering). All these stocks are in¬ 
cluded in the CAC 40 French index, i.e. they are among the largest market capitalizations and 
most liquid stocks on the Paris stock exchange. 

For each stock, for each available trading day, we consider the data from 10:00 a.m. to 16:00 
p.m., i.e. a six-hour period at the heart of the trading day. The idea is to get rid of the very busy 
opening and closing period where the assumption of a stationary model may be difficult to fulfill. It 
is well-known that even on the six-hour period considered, one does observe a seasonal activity (the 
U-shaped pattern of financial activity), and one might get better ’’stationary” results by shortening 
the daily period under investigation. However, it will appear that our models actually provides 
surprisingly satisfying results even using the full day sample. For each stock, for each trading day, 
we compute the total numbers and the distributions of the sizes of: limit orders inside the spread; 
limit orders at the best quote; limit orders at the second best limit; partial market orders; and 
aggressive market orders. We also compute the time-weighted empirical distribution of the volume 
offered at the best quote and at the second best limit. 

Straightforwardly, the estimators of the Poisson parameters Ao,Ai,A 2 ,/U and (if needed by 
the model) are defined as the number of the associated events (aggressive limit order, limit order 
at the best quote, limit order inside the book, partial market orders, aggressive market orders) 
divided by the length of the time interval. For each of these types of orders, we also compute 
their respective mean order size ctq, cii, cj 2 , and ci^^. As for the cancellation parameters, we do 
not have any data allowing us to track the submitted orders individually, and therefore we cannot 
easily estimate an average lifetime 0 ^^ and 9^^ for cancelled orders at the best quote and inside 
the book. However, we can get an order of magnitude by using equilibrium relations of incoming 
and outgoing flows of the order book. Our data let us compute the time-weighted average volume 
offered at the best quote Li and at the second best quote L 2 . Then equating the average number of 
incoming and outgoing share in the order book, we set 9i and 62 so that Aicii = fi(T^ + fiAC^HA +^ 1-^1 
and A 2 CT 2 = 02 ^ 2 - 

Finally, following our theoretical framework, we will assume that all partial market orders are 
unit-sized, and therefore rescale all size and volume quantities by the average trade size. The 
rescaling gives us the empirical versions of the distributions ( 50 ,i); (< 72 ,i) and (7r2,j) (if needed 

by the model) with a support roughly included in 1,..., 50 (stock-dependent order of magnitude). 

5.2 Fitted models and benchmarks 

In the previous section we have presented two types of models (namely type-1, without partial 
market order, and type-2, with unit-sized partial market orders), each type having two variants 
(namely, a and b). In variants a, the distribution {g 2 ,i)ien*^ which represents the volume of the 
limit orders submitted inside the book, is a Dirac distribution on the atom 1. In variants b, it is 
geometric with parameter 0 < 92 < 1- The distribution {go,i)ieN*, which represents the volume of 
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the limit orders submitted inside the spread, has not been specihed up to now. In line with the 
other assumptions, we will assume in this empirical section that this distribution is a Dirac on 1 in 
variants a, and a geometric distribution with parameter 0 < go < 1 in variants b. 

We now add further elements of comparison for our models. First, for each type of model, 
we add a variant c in which both distributions (5o,i)ieN* and {g 2 ,i)i£N* are taken equal to their 
empirical counterpart. Furthermore, in order to underline the importance of the mechanism that 
takes into account aggressive orders and consequent upward and downward movements of the best 
price, we recall as benchmark the following simplistic model of the best quote, in which aggressive 
market and limit orders are ignored, which is equivalent to assume a constant price in our setting. 
This benchmark will be referred to as the Type-0 model. In the Type-0 model, limit orders arrive 
at rate Ai with volume distribution j market orders are unit-sized and arrive at rate g ; all 

standing shares have a (i.i.d.) exponential lifetime with parameter 9i > 0. It is thus easily shown 
that the volume at the best quote in the Type-0 model is the Markov process with infinitesimal 
generator given at equation This process admits a stationary distribution (7rj)jeN satisfying 
the following recurrence : 


0 = -XiTTo + {fi + 6i)7ri, 

n 

0 = -{\l+lJL + nei)'n:n + {g + {n + l)9i)'Kn+l + (ra > 1), 


(24) 


2=1 


with 


TTo = 




9i 


du, 


-1 


(25) 


where H{u ) = ^ being the generating function of the distribution {gi^i)i£n (see e.g. 


I — u 

Muni Tokd . l2014l . section 4). We will consider two cases: unit-sized limit orders (Model Oa) and 
limit orders with geometrically-distributed size with parameter qi (Model Ob). In line with the 
general models, one may assume that the last share cannot be cancelled or executed by shifting the 
indices of distribution (7ri)jgN by 1 (i.e. on N*). 

Finally, we add a second benchmark by simulating our general zero-intelligence model of the 
best quote with all distributions of the model equal to their empirical counterpart. This could be 
considered as the ’’best effort” of a zero-intelligence mechanism with both classic and aggressive 
market and limit orders to compute the distribution of the volume offered at the best quote. Note 
that we cannot solve analytically the distribution in this general setting: the distribution in this case 
is numerically estimated by simulation. This simulated benchmark will be referred to as Type-3 
model. 

Tabled] summarizes the models that are under study here. On Figure[T]the empirical distribution 
for the stock AIRP.PA of the size of the limit orders submitted inside the spread is plotted, along 
with its geometric MLE fit. As expected, the geometric fit (used in model lb and 2b) hts well the 


13 







Model type 

{90,i)i&N 

(fl'i,i)ieN 

(52,i)ieN 

(7r2,i)ieN 

/i > 0 ? 

Ao > 0, > 0 ? 

Model Oa 

None 

Unit-size 

None 

None 

Yes 

No 

Model Ob 

None 

Geometric 

None 

None 

Yes 

No 

Model la 

Unit-size 

Unit-size 

Unit-Size 

Poisson 

No 

Yes 

Model lb 

Geometric 

Geometric 

Geometric 

Neg. binomial 

No 

Yes 

Model Ic 

Empirical 

Geometric 

None 

Empirical 

No 

Yes 

Model 2a 

Unit-size 

Unit-size 

Unit-Size 

Poisson 

Yes 

Yes 

Model 2b 

Geometric 

Unit-size 

Geometric 

Neg. binomial 

Yes 

Yes 

Model 2c 

Empirical 

Unit-size 

None 

Empirical 

Yes 

Yes 

Model 3 

Empirical 

Empirical 

None 

Empirical 

Yes 

Yes 


Table 1: Summary of the different models and their variants. Italic means that the distribution 
(7r2,i)jGN (resp. {g 2 ,i)ieN) in these variants is not a free parameter, but a consequence of the choice 
of ( 5 ' 2 ,i)ieN (resp. (7r2,i)ieN)- 




Figure 1: Distribution (g'o,i)ieN (empirical and geometric fit) of the size of limit orders submitted 
inside the spread, main body (left panel) and tail in semi-log scale (right panel). These distributions 
are computed for the stock AIRP.PA, but all stocks studied give similar results. 
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Figure 2: Distribution (7r2,i)iGN (empirical, Poisson obtained with models 1 and 2a and Negative 
binomial obtained with models lb and 2b) of the volume offered at the second best quote, main 
body (left panel) and tail in semi-log scale (right panel). These distributions are computed for the 
stock AIRP.PA, but all stocks studied give similar results. 

main body of the distribution, but its tail is to thin to reproduce the empirical distribution (which 
is used as is in Models Ic, 2c and 3). 

On Figure [2] the empirical distribution for the stock AIRP.PA of the volume offered at the 
second best quote is plotted, along with its modelled counterparts. As expected, the empirical 
distribution is very different from the one obtained using unit-size orders (Models la and 2a): the 
empirical main body is shifted to the left and its tail is fatter. Using geometrically-distributed 
sizes of orders (Model lb and 2b) yields a better fit, with similar defects but of smaller amplitude. 
Models Ic, 2c and 3 will use the empirical distribution as input. 

5.3 Empirical results 

For all the stocks of our sample, we compute the analytical distribution of the volume offered at the 
best quote for all our models and variants, as well as the distribution obtained by simulation of the 
model 3. On figure [3] we compare these analytical distributions with their empirical counterpart. 
For brevity only three stocks are shown, but the results for all stocks are similar. The main result is 
that the killing and resurrecting mechanism used to compute the analytical distributions of models 
of types 1 and 2 is very efficient to produce an empirically-sound shape of the distribution of the 
volume offered at the best quote in an order book. Indeed, all models of type 1 and 2 exhibit a main 
shape close to the empirical one, as well as to the one of model 3. This means that the analytical 
formulas obtained above are a good alternative to compute a distribution that up to now was only 
obtained through simulation. On the contrary, on all stocks, the basic models Oa and Ob fail to 
reproduce the empirical distribution. This outlines the importance of the flows of aggressive limit 
orders and aggressive market orders when describing the volume at the best quote. This quantity 
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Figure 3: Distribution of the volume offered at the best quote for all the models described, 

compared to the empirical one, for three stocks: AIRP.PA (top), BNPP.PA (middle) and MICP.PA 
(bottom). The main body of the distribution is shown on the left column, the tail in semi-log scale 
is shown on the right column. 
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Model 

AIRP.PA 

ALSO.PA 

AXAF.PA 

BNPP.PA 

BOUY.PA 

CARR.PA 

DANO.PA ... 

Oa 

5.76E-002 

6.23E-002 

5.08E-002 

7.10E-002 

7.06E-002 

6.41E-002 

5.74E-002 ... 

Ob 

3.32E-002 

4.00E-002 

4.27E-002 

6.08E-002 

4.45E-002 

3.71E-002 

3.30E-002 ... 

la 

3.47E-003 

3.12E-003 

9.81E-004 

8.57E-004 

4.66E-003 

4.65E-003 

1.84E-003 ... 

lb 

1.47E-003 

2.03E-003 

3.28E-003 

4.65E-003 

2.31E-003 

1.57E-003 

5.98E-004 ... 

Ic 

1.55E-003 

2.77E-003 

1.13E-003 

8.33E-004 

2.93E-003 

9.46E-004 

1.02E-003 ... 

2a 

5.02E-003 

6.38E-003 

5.58E-003 

4.04E-003 

6.42E-003 

4.50E-003 

4.65E-003 ... 

2b 

8.40E-003 

6.37E-003 

2.78E-003 

3.54E-003 

8.18E-003 

8.56E-003 

5.81E-003 ... 

2c 

8.52E-003 

7.43E-003 

4.88E-003 

4.20E-003 

8.57E-003 

7.64E-003 

5.41E-003 ... 

3d 

3.27E-003 

1.06E-002 

1.53E-003 

1.79E-003 

9.07E-003 

4.66E-003 

7.37E-003 ... 


Model 

... MICP.PA 

RENA.PA 

SASY.PA 

SGEE.PA 

Average 

Rank 

Oa 

... 7.20E-002 

5.56E-002 

4.86E-002 

6.73E-002 

6.16E-002 

9 

Ob 

... 4.99E-002 

3.42E-002 

3.68E-002 

3.60E-002 

4.07E-002 

8 

la 

... 6.85E-003 

2.28E-003 

2.50E-003 

4.78E-003 

3.27E-003 

3 

lb 

... 2.73E-003 

9.57E-004 

4.49E-003 

1.41E-003 

2.32E-003 

2 

Ic 

... 3.19E-003 

1.42E-003 

2.39E-003 

1.14E-003 

1.76E-003 

1 

2a 

... 5.41E-003 

4.80E-003 

1.66E-003 

5.55E-003 

4.91E-003 

5 

2b 

... 9.08E-003 

6.06E-003 

4.05E-003 

1.04E-002 

6.66E-003 

6 

2c 

... 9.86E-003 

7.13E-003 

4.20E-003 

9.12E-003 

7.00E-003 

7 

3d 

... 1.30E-003 

5.46E-003 

1.42E-004 

6.96E-003 

4.74E-003 

4 


Table 2: Distance in L^-norm between the distribution of the volume at the best quote predicted 
by the various models and the empirical observation. Last column ranks the models from the best 
to the worst ht. 


cannot be modelled by a simple queue with limit order, partial cancellation and partial market 
order. 

Furthermore, as expected, large differences between the models are observed when looking at 
the tail of the distribution. Variants a, with unit-sized orders, exhibit the thinner tails. The 
geometric distributions for the size of orders at the best quote and in the book (variants b) allows 
for slightly fatter tails, but still much thinner than the empirically observed one. Very interestingly, 
variants c exhibit a tail of the analytical distribution of the volume at the best quote that is as 
fat as the empirical one. This shows that using the empirical distributions for the distributions 
{9o,i)i<=N (aggressive limit orders) and ('7r2,i)igN (after an aggressive market order) is sufficient to 
obtain a fat tail at the best quote, even if the distribution of the size of the limit orders arriving at 
the best quote is thin-tailed. This again underlines the prime importance of the aggressive order 
flows modelled with the killing and resurrecting mechanism in our models. 

We further compare the models by computing the distance (in L^-norm) between the empirical 
distribution of the volume at the best quote and its analytical/numerical counterpart in all the 
models described, on all the stocks of the sample. Then we compute an average distance on the 
sample, and rank all the models from the best to the worst fit. Table [2] presents all the numerical 
results. Several observations can be made. Firstly, the quality of the ht of models of type 1 is in 
average better that models of type 2. Recall that models of type 1 have only aggressive market 
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orders (no partial market orders) but allow for a flexible (geometric) size of limit orders at the 
best quote, while models of type require that all limit orders at the best quote are unit-sized. 
This observation underlines the benefit that one can get by allowing for a general size of orders 
in a limit order book model. Secondly, we may also observe a bit surprisingly that models of 
type 1 even provide a better fit of the volume at the best quote than the model 3, which might 
appear counter intuitive since model 3 is the result of a simulation that take all zero-intelligence 
parameters available while others models are limited. This warns us that many mechanisms occur 
during the fitting of the data, that are more or less hidden but nevertheless important. In particular, 
the average distribution of the volume at the best quote is computed in our examples as a time- 
weighted quantity. The influence of the time is then integrated in the model through the way we 
fit the cancellation parameters as described in subsection 15.11 Indeed, the cancellation parameters 
are not directly fitted but are a sort of adjustment variables. It turns out that models of type 1 
do a better job at integrating this dimension. As an illustration of this phenomenon, we redo the 
whole empirical part without taking into account the time while computing the average volume 
offered at the best quote. This means we work in ’’event time”, which in other words is equivalent 
to considering that the timestamps of the data are deleted and randomly regenerated in the same 
order with exponential inter arrival times. In this setting, model 3 shows as expected the best 
performance: its average L^-distance to the empirical distribution is equal to 2.16 x 10~^, while 
models of types 1 and 2 follow at a distance of 4.27 x 10~^ and more. 

6 Conclusion 

This paper has shown that basic zero-intelligence models of the limit order book are able to ac¬ 
curately describe the stationary distribution of volume offered at the best quote providing they 
include a proper mechanism to take into account aggressive orders that move the price: aggressive 
limit orders submitted inside the spread, as well as aggressive market orders that remove the whole 
liquidity available at the first level of the book. Here, we have modelled these aggressive orders 
using results on killing and resurrecting a Markov process (here, the evolution of the quantity at 
the best quote when the price is constant), which allows us to provide analytical formulas for the 
distribution of interest. 

We end this section by one last observation that may trigger future work on this subject. Figure 
[2] show that the theoretical models for (vr 2 ,i)i 6 N have the same defects in modelling the empirical 
distribution of the volume inside the book as Type-0 models with respect to the volume at the 
best quote: namely, a body shifted to the right and a tail too thin. This is not surprising as the 
volume inside the book as been treated here as one entity, without taking into account killing and 
resurrecting (as in models of Type-0). Future work may include the killing and resurrecting as a 
cascading effect: when the price move, all the limits in the book are shifted, i.e. are killed and 
resurrected. 
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